CNDB-12553: ensure that memtable is reclaimed even when notification subscribers throw #1545

jakubzytka · 2025-02-04T16:50:04Z

What is the issue

Cassandra doesn't properly support throwing notification subscribers
that fail flushes. In such case the flush is interrupted (despite
multiple uses of exception-safe code and accumulating exceptions)
after the sstable creation transaction committed, but before the
memtable has been reclaimed. As a result the memtable allocator believes
more and more memory is being used and being reclaimed eventually
stopping writes due to apparent lack of memory in the memtable.

What does this PR fix and why was it fixed

This patch changes memtable flushing behaviour so that the memtable
is reclaimed iff it has been removed from the View, regardless
of whether the flush fails or not.

github-actions · 2025-02-04T16:50:22Z

Checklist before you submit for review

Make sure there is a PR in the CNDB project updating the Converged Cassandra version
Use NoSpamLogger for log lines that may appear frequently in the logs
Verify test results on Butler
Test coverage for new/modified code is > 80%
Proper code formatting
Proper title for each commit staring with the project-issue number, like CNDB-1234
Each commit has a meaningful description
Each commit is not very long and contains related changes
Renames, moves and reformatting are in distinct commits

test/unit/org/apache/cassandra/db/memtable/FlushFailingOnNotificationSubscriberTest.java

jacek-lewandowski · 2025-02-06T10:51:59Z

src/java/org/apache/cassandra/db/ColumnFamilyStore.java

-                cfs.replaceFlushed(memtable, Collections.emptyList(), Optional.empty());
-                reclaim(memtable);
-                return Collections.emptyList();
+                try


Overall, the patch seems reasonable to me. However, as you said, it is not precisely defined what should happen in case of failure in replaceFlushed. To me, since we have no answer, perhaps the best way to deal with that would be to shutdown the node, and at the same time, make sure that the notification consumer implementation does not throw any exception. Otherwise - from the CNDB point of view - is a failure in notification consumer critical? Can we continue if it happens?

JeremiahDJordan

LGTM besides nits from Jacek

subscribers throw The direct cause of CNDB-12553 is that CNDB-specific subscriber to SSTableAddingNotification throws an error, and Cassandra doesn't handle it properly. In such case the flush is interrupted (despite multiple uses of exception-safe code and accumulating exceptions) after the sstable creation transaction committed, but before the memtable has been reclaimed. As a result the memtable allocator believes more and more memory is being used and being reclaimed eventually stopping writes due to apparent lack of memory in the memtable. This patch changes memtable flushing behaviour so that the memtable is reclaimed iff it has been removed from the View, regardless of whether the flush fails or not.

sonarqubecloud · 2025-02-25T12:23:28Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
85.7% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

cassci-bot · 2025-02-25T12:27:58Z

✔️ Build ds-cassandra-pr-gate/PR-1545 approved by Butler

Approved by Butler
See build details here

blambov · 2025-02-25T13:11:46Z

FYI this is one of the failure scenarios of CNDB's SSTable management that could lead to data loss.

jakubzytka · 2025-02-25T13:39:53Z

FYI this is one of the failure scenarios of CNDB's SSTable management that could lead to data loss.

I disagree. Each storage notification consumer is run independently. Thus, an exception in one notification consumer does not impact our ability to update ETCD in another one.
The data loss scenario is when we are unable to update ETCD because of other reasons.

(IMO this is not an argument to keep ETCD update outside of transaction handling; I just wanted to clarify that the consumers are independent)

jakubzytka force-pushed the cndb-12553-ensure-memtable-reclaimed-when-notification-subscriber-throws branch from 337bcb0 to 67e28f6 Compare February 5, 2025 09:53

jakubzytka requested a review from a team February 5, 2025 10:30

jacek-lewandowski self-requested a review February 5, 2025 14:36

jacek-lewandowski reviewed Feb 6, 2025

View reviewed changes

test/unit/org/apache/cassandra/db/memtable/FlushFailingOnNotificationSubscriberTest.java Outdated Show resolved Hide resolved

jacek-lewandowski reviewed Feb 6, 2025

View reviewed changes

test/unit/org/apache/cassandra/db/memtable/FlushFailingOnNotificationSubscriberTest.java Outdated Show resolved Hide resolved

jacek-lewandowski reviewed Feb 6, 2025

View reviewed changes

JeremiahDJordan approved these changes Feb 6, 2025

View reviewed changes

jakubzytka force-pushed the cndb-12553-ensure-memtable-reclaimed-when-notification-subscriber-throws branch from 67e28f6 to d262cb3 Compare February 25, 2025 11:47

jakubzytka merged commit 8d0b97e into main Feb 25, 2025
461 of 473 checks passed

jakubzytka deleted the cndb-12553-ensure-memtable-reclaimed-when-notification-subscriber-throws branch February 25, 2025 13:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CNDB-12553: ensure that memtable is reclaimed even when notification subscribers throw #1545

CNDB-12553: ensure that memtable is reclaimed even when notification subscribers throw #1545

jakubzytka commented Feb 4, 2025

github-actions bot commented Feb 4, 2025 •

edited by jakubzytka

Loading

jacek-lewandowski Feb 6, 2025

JeremiahDJordan left a comment

sonarqubecloud bot commented Feb 25, 2025

cassci-bot commented Feb 25, 2025

blambov commented Feb 25, 2025

jakubzytka commented Feb 25, 2025 •

edited

Loading

CNDB-12553: ensure that memtable is reclaimed even when notification subscribers throw #1545

CNDB-12553: ensure that memtable is reclaimed even when notification subscribers throw #1545

Conversation

jakubzytka commented Feb 4, 2025

What is the issue

What does this PR fix and why was it fixed

github-actions bot commented Feb 4, 2025 • edited by jakubzytka Loading

Checklist before you submit for review

jacek-lewandowski Feb 6, 2025

Choose a reason for hiding this comment

JeremiahDJordan left a comment

Choose a reason for hiding this comment

sonarqubecloud bot commented Feb 25, 2025

Quality Gate passed

cassci-bot commented Feb 25, 2025

✔️ Build ds-cassandra-pr-gate/PR-1545 approved by Butler

blambov commented Feb 25, 2025

jakubzytka commented Feb 25, 2025 • edited Loading

github-actions bot commented Feb 4, 2025 •

edited by jakubzytka

Loading

jakubzytka commented Feb 25, 2025 •

edited

Loading